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DEsrrRTPTT(;%]<f 



gackgronnd of th^ Tn^^°nM-n 

The present invention relates to novel thermo- 
stable DNA polymerases, the genes and vectors encoding 
them and their use in DNA sequencing. 
5 US Patents 4,889.818 and 5,079,352 describe the 

isolation and expression of a DNA polymerase known as 
Taq DNA Polymerase (hereinafter referred to as Taq) . it 
is reported that amino- terminal deletions wherein 
approximately one-third of the coding sequence is absent 
10 have resulted in producing a gene product that is quite 
active in polymerase assays. Tag is described as being 
of use in PGR (polymerase chain reaction) . 

US Patent No. 5,075,216 describes the use of Taq in 
DNA sequencing. 

15 International patent application WO 92/06/06188 

describes a DNA polymerase having an identical amino 
acid sequence to Taq except that it lacks the N-terminal 
235 amino acids of Taq and its use in sequencing. This 
DNA polymerase is known as A Taq. 
20 US Patent 4,795,699 describes the use of T7 type 

DNA polymerases (T7) in DNA sequencing. These are of 
great use in DNA sequencing in that they incorporate 
dideoxy nucleoside triphosphates (NTPs) with an 
efficiency comparable to the incorporation of deoxy 
25 NTPs; other polymerases incorporate dideoxy NTPs far 
less efficiently which requires comparatively large 
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quantities of th.se to be present in sequencing 

reactions . 

At the DOE contractor-Grantee Workshop (Nov. 13-17 
Santa Fe, and the I. Robert Lehman Syn,posiu. (Nov' 
11-14, 1994, Sonon.a), Prof. s. Tabor identified a site 
- DNA polymerases that can be modified to incorporate 
dxdeoxy NTPs .ore efficiently. He reported that the 
presence or absence of a sin.le hydroxy group (tyrosine 
vs Phenylalanine, at a highly conserved position on . 
con, BNA Pol^erase 1, X7, and Ta, makes more than a ' 
1000-fold difference in their ability to discriminate 
agaxnst dideoxy NTPs. (See also European Patent 
Applzcation 94203433.1 published May 31, 1995, 

Publication No. 0 655 506 ai u . 

_ , hereby incorporated by 

i5 reference herein.) 

gvtnimflrv of t-h^ T^y^ntinn 

The present invention provides a DNA polymerase 
having an amino acid sequence differentiated from Taq in 
that it lacks the N-ter.inal 272 a.ino acids and has the 
Phenylalanine at position 667 (of native Taq, replaced 
by tyrosine. Preferably, the DNA polymerase has 
n>ethionine at position l (equivalent to position 272 of 
Taq, (hereinafter referred to as Fy2, The full DNA 
sequence is given as Fig 1 (SEQ. id. NO. 1, . in^i.^,^ 
25 within the scope of the present invention are DNA 

polymerases having substantially identical amino acid 
sequences to the above which retain thermostability and 
efficxent incorporation of dideoxy NTPs. 
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20 



By a substantially identical amino acid sequence is 
meant a sequence which contains 540 to 582 amino acids 
that .ay have conse^ative an^no acid changes compared 
«xth Taq which do not significantly influence 
5 thermostability or nucleotide incorporation, i.e. other 
than the phenylalanine to tyrosine conversion. Such 
changes include substitution of like charged amino acids 
for one another, or amino acids with small side chains 
for other small side chains, ala for val. More 

10 drastic changes may be introduced at noncritical regions 
where little or no effect on polymerase activity is 
observed by such a change. 

The invention also features DNA polymerases that 
lack between 251 and 293 (preferably 271 or 272) of the 
N-terminal amino acids of rher^us fiavus (Tfl) and have 
the phenylalanine at position SS6 (of native Tfl) 
replaced by tyrosine; and those that lack between 253 
and 295 (preferably 274) of the N-terminal amino acids 
of mem,us ther^ophilus (Tth) and have the phenylalanine 
at position 669 (of native Tth) replaced by tyrosine. 

By efficient incorporation of dideoxy NTPs is meant 
the ability of a polymerase to show little, if any. 
discrimination in the incorporation of ddNTPs when' 
compared with dNTPs. Suitably efficient discrimination 
IS less than l.-io and preferably less than 1:5. Such 
discrimination can be measured by procedures known in 
the art. 

one preferred substantially identical amino acid 
sequence to that given above is that which contains 562 
amino acids having methionine at position l and alanine 



4 

at position a ,oor.espo„ai„g 

"«ive Ta„ ,.e.ein,ner «fe„ea .o .r^, 

™* -^e„ce 13 ,i.,„ „ .1 

invention. ^ present 



'I'- purified ENA polymerases m ,„a pv3 h .k 
■ » polypeptide band on SDS '""^ ™ both give 

" position „bil £o or tyrosine 

' wnicn forms several t^^i, 
10 siniiiar size on SDS noi Polypeptide bands of 

on SDS polyacrylamide gels 
A second preferred substantial! 
acid sequence is . ^^^"txally identical amino 

<juence IS that which lacks 274 of the r. ^ • 
amino acids of Th^r-™ N- terminal 

position 1, and the^phen^^ "-ethionrne at 

" at position 39. (J/ " '° 

^-ein:fterrrt? - 

~eis,i.enas : 3 ;::Tr '""^^^^ 

y- ^ (SEQ. ID. NO. 14). 
The present invention also provides a a. 

^0 - ~ To:::::: 

- --d ^r-rr 

- :e~ :r"^- - - -ntion, 

" = '7~.^c.sitio:::::::r;::;°^"'°" 

J-n a further asDect m.^ 
^ ^eoto. oontainin, V.^— tt^: " 
, •ct.vity o, the present invention ' e 

a"lno acid se^enoe difierentiated r" ' 

° that it iacs the »-ter.inaT a ""'^ 

rmmal 272 ammo acids and has 
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phenylalanine at position 396 (equivalent to position 
667 of Tag) replaced by tyrosine or a substantially 
identical amino acid sequence thereto. 

In a yet further aspect, the present invention 
5 provides a host cell comprising a vector containing the 
gene encoding the DNA polymerase activity of the present 
invention/ g.g. , encoding an amino acid sequence 
differentiated from native Taq in that it lacks the N- 
terminal 272 amino acids and has phenylalanine at 
10 position 396 (equivalent to position 667 of native Taq) 
replaced by tyrosine or a substantially identical amino 
acid sequence thereto.. 

The DNA polymerases of the present invention are 
preferably in a purified form. By purified form is 
15 meant that the DNA polymerase is isolated from a 

majority of host cell proteins normally associated with 
it; preferably the polymerase is at least 10% (w/w) of 
the protein of a preparation, even more preferably it is 
provided as a homogeneous preparation, e.g. . a 
20 homogeneous solution. Preferably the DNA polymerase is 
a single polypeptide on an SDS polyacrylamide gel. 

The DNA polymerases of the present invention are 
suitably used in sequencing, preferably in combination 
with a pyrophosphatase. Accordingly, the present 
25 invention provides a composition which comprises a DNA 
polymerase of the present invention in combination with 
a pyrophosphatase, preferably a thermostable 
pyrophosphatase such as Thermoplasma acidophilum 
pyrophosphatase. (Schafer, G.- and Richter, O.H. (1992) 

30 gur, J, Bipgh em. 2111. 351-355) . 
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The DNA polymerases of the present invention can be 
constructed using standard techniques. By way of 
example, mutagenic PGR primers can be designed to 
incorporate the desired Phe to Tyr amino acid, change (FY 
5 mutation) in one primer. In our hands these primers 

also carried restriction sites that are found internally 
in the sequence of the Taq polymerase gene clone of 
Delta Tag, pWB253, which was used by us as template DNA. 
However, the same PGR product can be' generated with this 
10 primer pair from any clone of Taq or with genomic DNA 
isolated directly from Thermus aquaticus. The PGR 
product encoding only part of the gene is then digested 
with the appropriate restriction enzymes and used as a 
replacement sequence for the clone of Delta Tag digested 
15 with the same restriction enzymes. In our hands the 

resulting plasmid was designated pWB253Y. The presence 
of the mutation can be verified by DNA sequencing of the 
amplified region of the gene. 

Further primers can be prepared that encode for a 
20 methionine residue at the N-terminus that is not found 
at the corresponding position of Tag, the sequence 
continuing with amino acid residue 273. These primers 
can be used with a suitable plasmid, e.g. , pWB253Y DNA, 
as a template for an^lif ication and the amplified gene 
25 inserted into a vector, e.g,, pRE2, to create a gene, 
Q^gt / PRE273Y, encoding the polymerase {FY2) . The 
entire gene can be verified by DNA sequencing. 

Improved expression of the DNA polymerases of the 
present invention in the pRE2 expression vector was 
30 obtained by creating further genes, pREFY2pref (encoding 
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a protein identical to FY2) and pREFY3 encoding FY3 . A 
• mutagenic PCR primer was used to introduce silent codon 
changes {i.e., the amino acid encoded is not changed) at 
the amino terminus of the protein which did not affect 
5 the sequence of the polypeptide. These changes led to 
increased production of FY2 polymerase. FY3 was 
designed to promote increased translation efficiency in 
vivo. In addition to the silent codon changes 
introduced in pREFY2pref , a GCT codon was added in the 
10 second position (SEQ. ID, NO. 2), as occurs frequently 
in strongly expressed genes in E. coli. This adds an 
amino acid to the sequence of FY2, and hence the protein 
was given its own designation FY3 . Both constructs 
produce more enzyme than pRE273Y. 
15 Silent codon changes such as the following increase 

protein production in E. coli: 
substitution of the codon GAG for GAA; 

substitution of the codon AGG, AGA, CG6 or CGA for CGT or 
CGC; 

20 substitution of the codon CTT, CTC, CTA, TTG or TTA for CTG; 
substitution of the codon ATA for ATT or ATC; 
sxibstitution of the codon GGG or GGA for GGT or GGC. 

The present invention also provides a method for 
determining the nucleotide base sequence of a DNA 

25 molecule. The method includes providing a DNA molecule 
annealed with a primer molecule able to hybridize to the 
DNA molecule; and incubating the annealed molecules in a 
vessel containing at least one deoxynucleotide 
triphosphate, and a DNA polymerase of the present 

3 0 invention. Also provided is at least one DNA synthesis 
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terminating agent which terminates DNA synthesis at a 
specific nucleotide base. The method further includes 
separating the DNA products of the incubating reaction 
according to size, whereby at least a part of the 
5 nucleotide base sequence of the DNA molecule can be 
determined. 

In preferred embodiments, the sequencing is 
performed at a temperature above 50**C, 60**C, or 70**C. 

In other preferred embodiments, the DNA polymerase 
10 has less than 1000, 250, 100, 50, 10 or even 2 units of 
exonuclease activity per mg of polymerase (measured by 
standard procedure, see below) and is able to utilize 
primers having only 4, 6 or 10 bases; and the 
concentration of all four deoxynucleoside triphosphates 
15 at the start of the incubating step is sufficient to 

allow DNA synthesis to continue until terminated by the. 
agent, e.g. , a ddNTP. 

For cycle sequencing, the DNA polymerases of the 
present invention make it possible to use significantly 
20 lower amounts of dideoxynucleotides compared to 
naturally occurring enzymes. That is, the method 
involves providing an excess amount of deoxynucleo tides 
to all four dideoxynucleotides in a cycle sequencing 
reaction, and performing the cycle sequencing reaction. 
25 Preferably, more than 2, 5, 10 or even 100 fold 

excess of a dNTP is provided to the corresponding ddNTP. 

In a related aspect, the invention features a kit 
or solution for DNA sequencing including a DNA 
polymerase of the present invention and a reagent 
30 necessary for the sequencing such as dITP, deaza GTP, a 
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chain terminating agent such as a ddNTP, and a 
manganese-containing solution or powder and optionally a 
pyrophosphatase. 

in another aspect, the invention features a method 
5 for providing a DNA polymerase of the present invention 
by providing a nucleic acid sequence encoding the 
modified DNA polymerase, expressing the nucleic acid 
within a host cell, and purifying the DNA polymerase 
from the host cell . 

10 In another related aspect, the invention features a 

method for sequencing a strand of DNA essentially as 
described above with one or more (preferably 2, 3 or 4) 
deoxyribonucleoside triphosphates, a DNA polymerase of 
the present invention, and a first chain terminating 
15 agent. The DNA polymerase causes the primer to be 

elongated to form a first series of first DNA products 
differing in the length of the elongated primer, each 
first DNA product having a chain terminating agent at 
its elongated end, and the number of molecules of each 
20 first DNA products being approximately the same for 

substantially all DNA products differing in length by no 
more than 20 bases. The method also features providing 
a second chain terminating agent in the hybridized 
mixture at a concentration different from the first 
25 chain terminating agent, wherein the DNA polymerase 
causes production of a second series of second DNA 
products differing in the length of the elongated 
primer, with each second DNA product having the second 
chain terminating agent at its elongated end. The 
30 number of molecules of each second DNA product is 
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20 



25 



approximately the same for substantially all . . 

products differing in lenc^th f '"''^ 

t- on . ^ ^""^ °ther by from i 

to 20 bases, and is distinctly differen^ . 

Of molecules of all the f • ^^^^ "umber 

length d.ffer.ng by no more than 20 bases from that of 
said second dna products. °' 
in preferred embodiments, three or four such ■ 
te^inating agents can be used to ma.e diffL 
products and the sequence reaction is provided .th a 
T~ ion, or even a manganese or iron ion \^ 
a oncentration between COS and loO ^, p^eferab^' 
--ween i and 10 ^) ^^^^^^^^ 

~ng to molecular weight in four or less lane^ oTa 
' In another related aspect • 

olrgonucleotide primer, a nucleic acid ^ k 

"c^ieic acad to be sequenced 

'=-"-""3 agents in ai«ere„t .»ou„.., 

ch.„ te„„ati„g agent .ay be a dideo^ynucleotide 
te™. , ^^^^^^^ ^^^^^^ 

The ^t.„a ^„...e. i„.i„,,. ,,,,,,,, 

fra,™.„.3 »y .i.e a„a aete„i„i„, .,e nucleic acia 

sequence. The agents are differentia^.^ 

w . ^-^J-terenCiated from each othf^r- 

hy intensity of a label in ..i, ■ 

a label in the primer extension 

products. 



wo 96/38568 



PCT/US5W06906 



11 



it is co„™o„ electrophoresis to 

£U.ht ^ss speotro».try, electron microscopy a„a 

™' --""on also features en euto»»ted Dm 
sequencrng apparatus having a reactor incl ^- 
10 Which provide at 1. . "><=ludrn9 reagents 

fro™ a si„r ■ °' °™ '"^^ 

trora a single pramer and a DNA shv^n^ „ . 

Of a series diff • ^''^ ™^ 

ser.es differs m molecular weight and has a chain 
terminating agent at one end Th. 

BNA polymerase of th '""'"^^ ^ 

ymerase of the present invention Th« 
15 includes a separating apparatus 
separating means for separating the DNA 
product along one axis of th. 

Of bands It al '° ^ 

deter!- ■ ° "^'"'^^ ^ ^^^^ng means for 

d rmining the position and intensity of each hand 

- tlT, t:r"°" - ^ computing means 

th t d ermines the ... sequence of the n.. strand 
solely f.om the position and intensity of the hands 
along the axis and not fr-om 

Of liaht . wavelength of emission 

or light from any lah^i ^»,,^- 

ny label that may be present in the 
separating means. 

= other features and advantages of the Invention will 

-^e apparent fro„ the foUowing description of the 
preferred e^odi„ents thereof, and fro„ the claims 



The drawings win first briefly he described. 
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Figs 1-4 are the DNA sequences 
^ cyuences, and corresponding 

ain.no acxd sequences, of ^2, r,3, and the DNA 

respectively. Figure s is the 0» se^ZTaT^ 
corresponding amino acid sequence of m. 

The following exa^les se^e to illustrate the 
polymerases of the Dj-f^^^r.^ • . 
10 present invention and their use in 

10 sequencing. 



Bacterial Strains 

coli stains, _ ^^^^ ^ 

>, rec«, „1A1. endiJ, thj-j, 

A (Hi) )] . + , 

PGR 

Reaction conditions based on the procedure of 
Barnes (91 Pror Man i 

as £01, ^'^^^-^'^tJ^-S'^^d.^ 22X6-2220, a„4) were 
« fo lows: 20- ..ici„e pHs.e. S5« KOAc, 200« dKTPs, 
XO. glyce.01, «so, 0.5« each p.i„,, 
-5 U „ot^. ,^„,„„ _ ^^J^^"' 

De V.„, ^^^^^ ^^^^^^ 

'^-'""S conditions were ,,-0 30s es-c 
lomos for e cycles; then u-c 30s, es-c 12»„03 for 0 

^ycies; then 94°C 30s ^;flor. 

940c ,n . ' ' then 

94 C 30s, 68OC 14m40s for 8 cycles. 
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Ji3 vit2-o mutagenesis 

-,„e digestions, piasnia 
preparations, and other in 

were per^o^ea uai„, Inl;; 
5 =1 M„, , ""'J^rd protocols (Saa*rook at 

Molecular Cloning 2nd Ed colrfc • 
1989). PCR ,=e. "'"^ ""'^"^ Press, 

iwnicn as codon 396 of FY2) 

0 1TT3T"" ^ --~oc„c™ 

re^io! o 's rrrr"° '° " - 

-te. Mutagenic oiilo' ' """^ 

aytinic oiigo primer 2 

OGACCCCGTAGTTG.^, (SEQ. XD NO , ) /""""""^"^ 
« 1178 to i?4. • , ^^"^ nucleotides 

to 1241 including an Nhel site and . ^ 

sequence ID. NO i . °* 

first 235 . • °f — Tag deleted for the 

^irst 235 amino acids nwpoco 

i.ol^ ^ " encoding DeltaTag 

polymerase (Barnes M-y 

- a^atic. 1^:^^^::^ 

amplify the identical PCR . ^l"ed to 

ciicicai PGR product. The Prp ^ 

l^..tea to Bs^v^hex ai.est.d p^.sa plas^ia to replace 
the corresponding fragment to create pWB253v / 

zTr °' -9":::" 

use .or transformation and induction o, protein 
e«Press.on, although any host strain carrying a lac 
repressor could .a substituted . o» ,e^e„cing verified 
^"e Phe to ^ Change in the coding region 
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yne Sites, was used with Pn? - 

= containing restriction site- „rt 

—on s„ o, se^.„„ " 
°>don and restri^.- "othionine start 

• . ^ primer pairs anH 

Plasmxd pWB253y a. tenplate. The PCR , 

digested with restrict,- 

n restriction enzymes Ndel and Konl anH 
ligated to Ndel/KnnT ^- ""^ 

"ei/Kpni digested vector dRE? ■ 

" --^...^^^..u^^ "-3-10., r:o" r- 

- P-.3„, pol^ara e .l^Z 
encoding FY2 polvmer-a«» PRE273Y 
"ram .HX- wl^; ?' Cells or 

"is an. an sl:^ :^'-"^ —nation „it. 

- oo^paraM. paL T"" 

rec- cj- strain . . ^^^"ed. Alternatively, any 

a= nallail:: a j;? " '"""^ — -CH 

ixxc acid to produce the polymerase. The 
-fences of both genes .ere verUied pRB2,3v 
> to produce a c- -. PRE273Y was found 

produce a single polypeptide band on SDS 

polyacrylamide ae]=! ,.r,i -i 

e gels, unlike PRE253Y or PRE236Y 

- P.odnct .ntrod inTs rrd""^ "^^ ' 

VY2 Th. ' changes In oodon usage of 

""h Kdel/aa^I and 



wo 96/3SS6S 



^CT/X}S96m906 



15 



U.«ea to a pRH. construct co„tai.i„, the 3- end o, PV. 
o ceate pHKrv.p..., „^ 

NO. 9) and pramer 4 were used to make a PCR 

j-gesced fragment was used as 
a^ove to create pREPV3, encoding PV3 o,, 

10 Bacterial Strains 

^. coii strains: DHIV f^,^,,, 

CI857, A (HI))). 



PCR 



C.no„lc D» „a. prepared by standard lechni^e. 
fro. t..^,,,„,. ^ ^ ^ 

20 r 3™°"!= ■»« digested prior 

- PC „ith M«x, and .ra^ents appro^i^tel^ J 
were selected agarose ge. eiectropHoresis to .e used 

10». Trxs pHe.3, so« KCl, soo,M dm^.s, 0.00a. gelatin 
1.0« each pri.er. ..5« m^cI,, ^.s „ Tth, 0.025 „ ' 
Deepvent ,Hew .„,la„d Biol..s, , per .00,1 reaction 
Cycling conditions were S4»c 2 ™in, then 35 cycles of 
30s, .55-C 30s, 72=c 3 ™in, followed by ,2 -c for 7 
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In vitro mutagenesis 

Restriction enzyme digestions, plasmid 
preparations, and other in Wt.o manipulations of 
were performed using standard protocols (Sa^roo. et 
al., . p.as.id pMH. was constructed to encode an 

exonuciease-free poi„, 

expressaon in coii at the 5' end. Primer S (SHQ XB 

used With Printer 9 (SEQ. id. ii, 
10 (GGGGTACC^CCCTTOOCGCAAAGCCAGTC) to create a PCR 

product fro. .t. genomic U.., which was digested with 
restractxon enzy.es Ndel and Kpnl and inserted into 
plasmid PRE2 (Reddi et al igsQ m..^-i • 
X7, 10473 - 10488, digested with the same enzymes. 

TO create the desired mutation, two PCR 

products were made from Tt, chromosomal m^. primer 8 
above was used in combination with Primer lo (SKQ ID 
NO. 12, (OOGATGGCTAGCTCCTGGGAGAGCCTATGGGCGGACAT 
C.CCGTAGAGGACGCCGXAG..CACCO, to create a portion of the 

a sxlent change to create an Nhel restriction site 
Prxmer ll (SEQ. id. no. 13, (CTAGCTAGCCATCCCCTA 
CGAAGAAGCGGTGGCCT, was used in combination with primer S 
above to create a portion of the gene from the 
2S introduced Nhel site to the stop codon at the 3' end of 
the coding sequence. The PCR product of Primers 8 and 
10 was digested with Ndel and Nhel. and the PCR product ' 
Of Primers 9 and 11 was digested with Nhel and Kpnl 

30 waTl-'^'^ vector PRE2 which 

was d.gested with Ndel and Kpnl to produce plasmid pMRS 
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in addition to the desired changes, p«Rs was found to 
have a spurious change introduced by „M<:h led to 

c~t'°'r"'""""°"- ^^^^^^ 
= B.Lz?"""° -stitution, repiacin, the 

Atiii/BamHI fragment of pMRs for th» 

f ror the corresponding 
tragment from pMRi. The vva 

^''^ polymerase encoded by 
Plas„.d PM.S ,SEO. 10. »0. i„ ^^^J ^ 

trans^'V' ""^ ^« P-^-^V 

transformation, and strain ,Xcxas„ „3s used for 

prote„ e:^ressio„, although any oo„paraMe pair of . 

uT'i- T''" ""^'"^ -"^l" -uid be 

utrUzed. Mtematively, any 

T by che^oal agents such as nalidixic acid to 
produce the polymerase. 
15 Protean Sequencing 

determinations of amino terminal protein se,^e„ces 
were performed at the „.„. Kec. Foundation 
Biotechnology Resource Laboratory, New Haven, 
Connecticut. 

Purification of Polymerases 

Pact *v'"'"" """" °' ^•"-'^"ne, i. 

Pacto-yeast Bxtract, „.s. „aCl, . ^^amino *cids . 

20 « KPO. p„ ,.5 . so ,g/ml *.pioilu„ 

with a glycerol stoc. of the appropriate cell strain and 
.ro™ at 30^ „ith agitation until cells were in log 
Phase ,o.,...„. oo„., . , u,,„ ^ ^ ^ 

Acrds . 20 » KPO, p„ 7.5 . o.osv Mazu Snti-foam was 
rnoculated with 1 liter of log phase cells in lo liter 
mcroferm Fermentors (New Brunswick Scientific Co , 
30 cells were grown at ,0=C under IS psi pressure, 3S0-4S0 
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rpm agitation, and an air flow rate of 14,000 cc/min 
±1000 cc/min. When the OD590 reached 1.5-2.0, the 
cultures were induced by increasing the temperature to 
40-42°C for 90-120 minutes. The cultures were then 
5 cooled to < 20°C and the cells harvested by 

centrifugation in a Sorvall RC-3B centrifuge at 5000 rpm 
at 4'»C for 15-20 minutes. Harvested cells were stored 
at -80°C. 

Frozen cells were broken into small pieces and 
10 resuspended in pre-warmed (90-95°C) Lysis Buffer (20 mM 
Tris pH 8.5, 1 mM EDTA, 10 mM MgCl^, 16 mM (NHJ^SO,, 0.1% 
Tween 20, 0.1% Nonidet P-40, 1 mM PMSF) . Resuspended 
cells were then heated rapidly to SO^C and incubated at 
80^C for 20 minutes with constant stirring. The 
15 suspension was then rapidly cooled on ice. The cell 

debris was removed by centrifugation using a Sorvall GSA 
rotor at 10,000 rpm for 20 minutes at 4*^0. The NaCl 
concentration of the supernatant was adjusted to 3 00 mM. 
The sample was then passed through a diethylaminoethyl 
20 cellulose (Whatman DE-52) column that had been 

previously equilibrated with Buffer A (20 mM Tris pH 
8.5, 1 mM EDTA, 0.1% Tween 20, 0.1% Nonidet P-40, 300 mM 
NaCl, 10% glycerol, 1 mM DTT) , and polymerase collected 
in the flow through. The sample was then diluted to a 
25 concentration of NaCl of lOOmM and applied to a Heparin- 
sepharose column. The polymerase was eluted from the 
column with a NaCl gradient (100-500 mM NaCl) . The 
sample was then dialyzed against Buffer B (20 mM Tris pH 
8.5, 1 mM EDTA, 0.1% Tween 20, 0.1% Nonidet P-40, 10 mM 
30 KCl, 10% glycerol, 1 mM DTT) and further diluted as 
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needed to lower the conductivity of the sample to the 
conductivity of Buffer B. The sample was then applied 
to a diethylaminoethyl (Waters DEAE 15 HR) column and 
eluted with a 10-500 mM KCl gradient. The polymerase 
5 was then diluted with an equal volume of Final Buffer 
(20 mM Tris pH 8.5, 0.1 mM EDTA, 0,5% Tween 20, 0.5% 
Nonidet P-40, 100 mM KCl, 50% glycerol, 1 mM DTT) and 
dialyzed against Final Buffer. 

Assay of Exonuclease Activity 

10 The exonuclease assay was performed by incubating 5 

ul (25-150 units) of DNA polymerase with 5 ug of 
labelled PH] -pBR322 PCR fragment (1.6x10* cpm/ug DNA) in 
100 ul of reaction buffer of 20 mM TrisHCl pH 8.5, 5 mM 
MgCl2, 10 mM KCl, for 1 hour at 60 ^C. After this time 

15 interval, 200 ul of 1:1 ratio of 50 ug/ml salmon sperm 
DNA with 2 mM EDTA and 20% TCA with 2% sodium 
pyrophosphate were added into the assay aliquots. The 
aliquots were put on ice for 10 rain and then centrifuged 
at 12,000g for 10 min. Acid-soluble radioactivity in 200 

20 ul of the supernatant was quantitated by liquid 

scintillation counting. One unit of exonuclease activity 
was defined as the amount of enzyme that catalyzed the 
acid solubilization of 10 nmol of total nucleotide in 30 
min at 60 "C. 



25 Utility in DNA Sequencing 

Example 1: DNA Sequencing with FY Polymerases (e.a 

FY2 and FY3) 
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The following components were added to a 
microcentrifuge vial (0.5 ml) : 0.4 pmol M13 DNA (e.g./ 
Ma3mpl8, 1.0 ^g); 2 fxl Reaction Buffer ( 260 mM Tris- 
HCl. pH 9.5 65 mM MgCl,) ; 2 ^1 of labeling nucleotide 
5 mixture (1.5 each of dGTP, dCTP and dTTP) ; 0.5 ^1 (5 
MCi) of [a-»p]dATP (about 2000Ci/mmol) ; 1^1 .40 primer 
(0.5 MM; 0.5 pmol/^l 5 'GTTTTCCCAGTCACGAC-a ' ) ; 2 /il of a 
mixture containing 4 U/a/1 FY polymerase and 6.6 U/ml 
Thermoplasma acidophilum inorganic pyrophophatase 
10 (32 U/Ml polymerase and 53 U/ml pyrophosphatase in 20 mM 
Tris (pH8.5) , 100 mM KCl, 0.1 mM EDTA, 1 mM DTT, 0.5% 
NP-40, 0.5% TWEEN-20 and 50% glycerol, diluted 8 fold in 
dilution buffer (10 mM Tris-HCl pH8.0, 1 mM 2- 
mercaptoethanol, 0.5% TWEBN-20, 0.5% NP-40)); and water 
15 to a total volume of 17.5 /.I. These components (the 

labeling reaction) were mixed and the vial was placed in 
a constant -temperature water bath at 45°C for 5 minutes. 

Pour vials were labeled A, c, G, and T, and filled 
with 4 fxl of the corresponding termination mix: ddA 
20 termination mix (150 //M each dATP, dCTP, dGTP, dTTP, 1.5 
m ddATP) ; ddT termination mix (150 fiM each dATP, dCTP, 
dGTP, dTTP, 1.5 MM ddTTP); ddC termination mix (150 f.M 
each dATP, dCTP, dGTP, dTTP, 1.5 /.M ddCTP) ; ddG 
termination mix (150 f^M each dATP, dCTP, dGTP, dTTP, 1.5 
25 fM ddGTP) . 

The labeling reaction was divided equally among the 
four termination vials (4 fil to each termination 
reaction vial), and tightly capped. 

The four vials were placed in a constant- 
30 temperature water bath at 72''C for 5 minutes. Then 4 ^1 
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of stop solution (95% Formamide 20 EDTA. o 05. 
Bromophenol Blue 0 n^^r v,,i 

vial and . . ^^^^^ --h 

vial, and heated briefly to lO'-an-n ^ ^■ 
T ,. y to /o -80 c immediately prior to 

loading on a sequencing gel (8% 
e „ . "9 gel (Sir acrylamide, 8.3 M urea) 

.u o ^^ ^^ . 

poly„,«ase or Aira, DBA pol5™„a.e. 

The following co„^,one„ts were added to a 

microcentrifuQe vial c iv , . 

. ^ . ^ "^"^ "-^^ ^hich which is suitable 

insertion into a thei^ocycler machine pe^ki, 

15 Elmer DNA Thermal Cycler) -on. . " 

^ycier) . o.05 pmol or more M13 DNA 
le-^, Ml3mpl8, 0.1 ua) or- n -. 

-i^ A*g J, or 0.1 ;zg double- stranded 

a™,- 0.5 .1 ,s ,ci, of ,„-»p,dATP (about 
4 O/;.! p. polymerase .„d s.. c/„l r*er«,p.aa.a 

rrrjT p-~tase ,,3 „/„i poi„„„, 

and 50% glycerol, diluted 8 fold 1„ dilution buffer ,10 

Tris-„cl.pH3.o, 1 « .-„er„ptoethanol, 0.5. „h™- 
20, 0.5* NP-40,,,. and water to a total volume of 1, 5 
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.i..dtr T""" "^"'"^ 

(Amersham) . The vi;:.! 

l^en^rature, „„ be cycled f™, s^'-.s-c , , 

10 temperatures work well fn. 

- -t pri.er.e.plates co^i„..,_. ^ J 

. -pleted in about 3 .^^,,33 per cycle. 

Pour vials were labeled A c G ...h ^ 
v-ith 4 ml of th. • ^^^^^d 

ml Of the corresponding tennination mix: ddA 
termxnatxon mix (150 each dATP dCTP dnrp . 

^if) , ddT termination mix M >5n „m 
dGTP, dTTP 1 s ,M (150 each dATP, dCTP. 

' cJTTP, X.5 uM ddTTP); ddC termination mix {150 
each dATP. dCTP, dGTP dTTP n c ^ 

' 1-5 ^ ddCTP); ddG 

termination mix (xso each dATP, dCTP, dGTP dTTP . . 
/^MddGTP). No addif,-^ ■> ' ^"^ 

No additional enzyme is added to the 
^0 termination vials tk^ 

The cyclea l,bcu„g faction „i.t„„ waa aivlaed 

-a=Uc„ ..al>, ana ove.U.a „U. „ " 
25 light mineral oil. ^ 

30 00 cycle. ,„c,e than .„ cycle, le u^eceeaa^, 
performed from SSV f or- i ^ 

120 aeconas. Thi, etep vas cc„ve„ie„tly 
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completed overnidht m-hc.^ • ■ ' ^ ' 

ignc. other times and temperatures are 

also effective. 

Six Ml Of reaction mixture was removed (avoiding 
oxl). 3 Ml Of stop solution (95% Formamide 20 mM EDTA 
S 0.05% Brcmophenol Blue. 0.05% Xylene Cyanol PP) added' 
and heated briefly to 70--80°C inunediately prior to 
loading on a sequencing gel. Autoradiograms rec^ired an 
18-36 hour exposure using Koda. XAR-5 fil. or Amersham 
HyperfilmMP. High-c^uality sequence results with 
10 -iform band intensities were obtained. The band 

xntensities were much more uniforn, than those obtained 
wxth Similar protocols using rag DNA polymerase or APag 
DHA polymerase. 

For either of the sequencing methods outlined in 
examples . and 2, 7-Oeaza-2 -deoxy-CTP can be substituted 
for dGTP in the labeling and termination mixtures at 
exactly the same concentration as dGTP. when this 
20 substitution is made, secondary structures on the gels 
are greatly reduced. Similarly, 2 • - 

deoxyinosinetriphosphate can also be substituted for 
dGTP but its concentration must be lO-fold higher than 
the corresponding concentration of dGTP. Substitution 
25 Of dlTP for dGTP is even more effective in eliminating 
compression artifacts than 7-deaza-dGTP. 
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gX^mpXg 4; Other Sequencing methods using FY 

DolvmeraRPfi 

FY polymerases have been adapted for use with many 
other sequencing methods, including the use of 
5 fluorescent primers and fluorescent-dideoxy- terminators 
for sequencing with the ABI 373A DNA sequencing 
instrument . 



gX^mplg ^; SPS-Polvacrvlamid e Gel Electrophores:^^ 

Protein samples were run on a 14 X 16 mm 7.5 or 10% 

10 polyacrylamide gel. (Gels were predominantly 10% 
Polyacrylamide using a 14 X 16 mm Hoefer apparatus. 
Other sizes, apparatuses, and percentage gels are 
acceptable. Similar results can also be obtained using 
the Pharmacia Phast Gel system with SDS, 8-25% gradient 

15 gels. Reagent grade and ultrapure grade reagents were 
used.) The stacking gel consisted of 4% acrylamide 
(30:0.8, acrylamide: bisacrylamide) , 125 mM Tris-HCl pH 
6.8, 0.1% Sodium Dodecyl Sulfate (SDS) . The resolving 
gel consisted of 7.5 or 10% acrylamide {30:0.8, 

20 acrylamide: bisacrylamide), 375 mM Tris-HCl pH 8.8, 0.1% 
SDS. Running Buffer consisted of 25 mM Tris, 192 mM 
Glycine and 0.1% SDS. IX Sample Buffer consisted of 25 
mM Tris-HCl pH 6.8, 0.25% SDS, 10% Glycerol, O.IM 
Dithiothreitol, 0.1% Bromophenol Blue, and ImM EDTA, A 

25 1/4 volume of 5X Sample Buffer was added to each sample. 
Samples were heated in sample buffer to 90-100°C for 
approximately 5 minutes prior to loading. A 1.5 mm 
thick gel was run at 50-100 mA constant current for 1-3 
hours (until bromophenol blue was close to the bottom of 
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the gel). The gel was stained with 0.025% Coomassie 
Blue R250 in 50% methanol, 10% acetic acid and destained 
in 5% methanol, 7% acetic acid solution. A record of 
the gel was made by taking a photograph of the gel, by 
drying the gel between cellulose film sheets, or by 
drying the gel onto filter paper under a vacuum. 

Other embodiments are within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 



AMERSHAM LIFE SCIENCE 



(ii) TITLE OF INVENTION: 



THERMOSTABLE DNA 
POLYMERASES 



(iii) NUMBER OF SEQUENCES: 



14 



(iv) CORRESPONDENCE ADDRESS: 



10 



(A) 
(B) 

(C) 
(D) 
(E) 
(F) 



ADDRESSEE ; 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
Word Perfect 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

25 (vii) PRIOR APPLICATION DATA: 



15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 
20 (D) SOFTWARE: 
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Prior applications total, 
including application 
described below: one 

(A) APPLICATION NUMBER: US 08/455, 68G 

(B) FILING DATE: May 31, 1995 

(viii) ATTORNEY/AGENT INFORMATION: 

Warburg, Richard J. 

(B) REGISTRATION NUMBER: 32,327 

(C) REFERENCE/DOCKET NUMBER: 219/3 04 -PCT 



10 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR SEQ ID NO: 1: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ix) FEATURE: 

(A) NAME/KEY: FY2 

(B) LOCATION: 1...1683 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG CTG GAG AGG CTT GAG TTT GGC AGO CTC CTC CAC GAG TTC GGC CT^* 48 
25 Met Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
1 5 10 15 

CTG GAA AGO CCC AAG GCC CTG GAG GAG GCC CCC TGG CCC CCG CCG GAA 96 
Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
20 25 30 

30 GGG GCC TTC GTG GGC TTT GTG CTT TCC CGC AAG GAG CCC ATG TGG GCC 144 
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Gly Ala Phe Val Qly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
35 40 45 

GAT CTT CTG GCC CTQ GCC GCC GCC AGG GGG GGC CGG GTC CAC CGG GCC 
Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
.55 60 

CCC GAG CCT TAT AAA GCC CTC AGG 6AC CTG AAG GAG GCG CGG GGG CIT 
Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
^^ 70 75 eo 

CTC GCC AAA GAC CTG AGC GTT CTG GCC CTG AGG GAA GGC CTT GGC CTC 
Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
85 90 

CCG CCC GGC GAC GAC CCC ATG CTC CTC GCC TAC CTC CTG GAC CCT TCC 
Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
=^00 105 110 

15 AAC ACC ACC CCC GAG GGG GTG GCC CGG CGC TAC GGC GGG GAG TGG ACG 
Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
115 120 125 

GAG GAG GCG GGG GAG CGG GCC GCC CTT TCC GAG AGG CTC TTC GCC AAC 
Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
2^ 135 140 

CTG TGG GGG AGG CTT GAG GGG GAG GAG AGG CTC CTT TGG CTT TAC CGG 
Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
' 150 155 160 



192 



240 



288 



336 



384 



432 



480 



GAG GTG GAG AGG CCC CTT TCC GCT GTC CTG GCC CAC ATG GAG GCC ACG 
^5 Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
165 170 175 



30 GCC GAG GAG ATC GCC CGC CTC GAG GCC GAG GTC TTC CGC CTG GCC GGC 
Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
195 200 205 



528 



GGG GTG CGC CTG GAC GTG GCC TAT CTC AGG GCC TTG TCC CTG GAG GTG 576 
Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
180 185 ;^3Q 



624 



CAC GCC TTC AAC CTC AAC TCC CGG GAC CAG CTG GAA AGG GTC CTC TTT 672 
His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
35 210 215 220 

GAC GAG CTA GGG CTT CCC GCC ATC GGC AAG ACG GAG AAG ACC GGC AAG 720 
Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Qly Lys 
225 230 235 240 
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250 255 

265 270 

^"^^ ™ CCG GAC CTC ATC CAC CCC AGG ACG GGC CGC 
Thr Tyr lie A.p Pro Leu Pro Asp I^u He His Pro Arg Thr Gly 

2B0 285 

S S ^ *CG GGC AGG CTA AGT 

Leu H.S Thr Arg Phe Asn Gin Thr Ala Ihr Ala Thr Gly Arg Leu Ser 

300 

AGC TCC GAT CCC AAC CTC CAG AAC ATC CCC GTC CGC ACC CCG CTT 6GG 

15 III '''^ '''' Z Z Ty 

315 

if f° ^^'^ ^'"^ TGG CTA ITG GTC 

Gin Arg lie Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu 



10 



768 



816 



864 



912 



960 



1008 



325 330 



335 



20 t^"" 7° f GTG CTC GCC CAC CTC TCC 1056 
20 Ala Leu Asp ^ Ser Gin He Glu Leu Arg Val Leu Ala Ss sir 



345 



350 



GGC 6AC GAG AAC CTC ATC CGG GTC TTC CAG GAG GGG CGG GAC ATC CAC 
Gly ASP Glu A.n Leu He Arg Val Phe Gin Glu Gly Arg Asp He 

360 3g5 



25 



111 '''^ '^^^ G<^C GTC GAC 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 



1104 



1152 



^ 375 380 
CCC CTC ATC CGC CGG GCG GCC AAG ACC ATC AAC TAC GGG GTC CTC TAC 

30 385 ^ <'ly 

"0 335 

GGC ATG TC6 GCC CAC CGC CTC TCC CAG GAG CTA GCC ATC CCT TAC GAG 
Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro T^ ^ 
.^05 410 

1^ ^^'^ *" ^ ™^ ""^ AGC ITC CCC AAG GTG 1296 

35 Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro ^s vlt 
420 425 430 



1200 



1248 



CGG GCC TCG ATT GAG AAG ACC CTC GAG GAG GGC AGG AGG CGG GGG TAC 1344 
Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
435 440 445 
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6TG GAG ACC CTC TTC GGC CGC CGC CGC TAC 6TG CCA GAC. CTA QAO GCC 
Val Giu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 

455 460 

CGG GTG AAG AGC GTG CGG GAG GCG GCC GAG CGC ATO GCC rrc AAC ATG 
Arg Val Lya Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
470 475 

CCC GTC CAG GGC ACC GCC GCC GAC CTC ATG AAG CTC GCT ATG GTG AAG 
Pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala «It W^l 
485 



490 



495 



10 CTC TTC CCC AGG CTG GAG GAA ATG GGG GCC AGG ATG CTC CTT CAG GTC 
Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 



500 

5^ ?:^? AAA GAG AGG GCG GAG GCC GTG 

Ala Pro Lys Glu Arg Ala 
520 525 

GCC CGG CTG GCC AAG GAG GTC ATG GAG GGG GTG TAT CCC CTG GCC GTG 
Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 

535 540 



505 510 

u,o , , --- CCA AAA GAG AGG GCG GAG uut iiiw 

His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 

X3 515 



1392 



1440 



1488 



1536 



1584 



1632 



CCC CTG GAG GTG GAG GTG GGG ATA GGG GAG GAC TGG CTC TCG GCC AAG 1680 
20 Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 

550 555 560 



GAG TGA 
Glu ♦ 



1686 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

{B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



1689 base pairs 
nucleic acid 
single 
linear 



(ix) FEATURE; 



(A) NAME/KEY: FY3 

(B) LOCATION: 1. . .1686 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATG GCT CTG GAA CGT CTG GAG TTT GGC AGC CTC CTC CAC GAG TTC GGC 



48 
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Met Ala Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His .Glu Phe Gly 
1 5 10 15 

CTT CTG GAA AGC CCC AAG GCC CTG GAG GAG GCC CCC TGG CCC CCG CCG 96 
Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
^ 20 25 30 

GAA GGG GCC TTC GTG GGC TTT GTG CTT TCC CGC AAG OAG CCC ATG TGG 144 
Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
35 40 45 

GCC GAT CTT CTG GCC CTG GCC GCC GCC AGG GGG GGC CGG GTC CAC CGG 192 
10 Ala Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Aro 
50 55 60 

GCC CCC GAG CCT TAT AAA GCC CTC AGG GAC CTG AAG GAG GCG CGG GGG 24 0 
Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly 
70 75 80 

15 CTT CTC GCC AAA GAC CTG AGC GTT CTG GCC CTG AGG GAA GGC CTT GGC 288 
Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly 
85 90 95 

CTC CCG CCC GGC GAC GAC CCC ATG CTC CTC GCC TAC CTC CTG GAC CCT 336 
Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
20 100 105 110 

TCC AAC ACC ACC CCC GAG GGG GTG GCC CGG CGC TAC GGC GGG GAG TGG 384 
Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
115 120 125 

ACG GAG GAG GCG GGG GAG CGG GCC GCC CTT TCC GAG AGG CTC TTC GCC 432 
25 Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala 
130 135 140 

AAC CTG TGG GGG AGG CTT GAG GGG GAG GAG AGG CTC CTT TGG CTT TAC 480 
Asn Leu Trp Gly Tlrg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr 
150 155 160 

30 CGG GAG GTG GAG AGG CCC CTT TCC GCT GTC CTG GCC CAC ATG GAG GCC 528 
Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala 
165 170 175 

ACG GGG GTG CGC CTG GAC GTG GCC TAT CTC AGG GCC TTG TCC CTG GAG 576 
Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu * 
35 180 185 190 

GTG GCC GAG GAG ATC GCC CGC CTC GAG GCC GAG GTC TTC CGC CTG GCC 624 
Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala 
195 200 205 

GGC CAC CCC TTC AAC CTC AAC TCC CGG GAC CAG CTG GAA AGG GTC CTC 672 
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15 



Gly His Pro Phe Asn^Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 

215 220 

TTT GAC GAG CTA GGG CTT CCC GCC ATC GGC AAG ACG GAG AAG ACC GGC 
Phe ASP Glu Leu Gly Leu Pro Ala lie Gly. Ly. Thr Glu Lys Th^ Gly 
230 235 240 

AAG CGC TCC ACC AGC GCC GCC GTC CTG GAG GCC CTC CGC GAG GCC CAC 
Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu A^a 

250 

CCC ATC GTG GAG AAG ATC CTG CAG TAC CGG GAG CTC ACC AAG CTO AAG 
Pro Xle val Glu Lys He Leu Gin Tyr Arg Glu Leu T^^ 

265 270 

?S Sr zZ"^"" p""^ ™^ CCG GAC CTC ATC CAC CCC AGG ACG GGC 
ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly 

280 285 

Sa t^"" """^ ^™ '^^^ ^ ACG GGC AGG CTA 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg "u 

200 



10 



720 



766 



B16 



864 



912 



20 



AGT AGC TCC GAT CCC AAC CTC CAG AAC ATC CCC GTC CGC ACC CCG CTT 9^0 
ser ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg So SI 



30 



GGG CAG AGG ATC CGC CGG GCC TTC ATC GCC GAG GAG GGG TGG CTA TTG 
Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly 1^ S "u 
325 



1008 



335 



25 t^'' r° ""^ ^''^ ^^^^ CTG GCC CAC CTC 1056 

25 val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His 2u 



340 



350 



TCC GGC GAC GAG AAC CTG ATC CGG GTC TTC CAG GAG GGG CGG GAC ATC 
ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly S| Sp zlt 

355 3S0 365 



1104 



CAC ACG GAG ACC GCC AGC TGG ATG TTC GGC GTC CCC CGG GAG GCC GTG 1152 
Hxs Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu aS vTl 

375 380 



35 3B5 



GAC CCC CTG ATG CGC CGG GCG GCC AAG ACC ATC AAC TAC GGG GTC CTC 
Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Tyr Gly Val Leu 

3B5 390 



1200 



395 



400 



TAC GGC ATG TCG GCC CAC CGC CTC TCC CAG GAG CTA GCC ATC CCT TAC 1248 
Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro T^ 
405 410 415 
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460 

480 

■^5 -ftsP Leu Met Lye Leu Ala Met Val 

"° 495 

s s s - s s J'l r 

500 Z ''^^ Met Leu Leu Gin 

^ 510 

515 I-ys Glu Arg Ala Glu Ala 

" 525 



560 
AAG GAG TGA 

i-ys Glu ♦ 



1689 



30 



35 



(2) INFORMATION FOR SEQ 10 NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Kr>o^ • 

(B) TYPE: ' ^^^^ ^^^^^^ 

nucleac acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: li^e^^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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GCTTGGGCAG AGGATCCGCC GGG 



(2) INFORMATION FOR SEQ ID NO: 4; 
(i) SEQUENCE CHARACTERISTICS: 



15 



20 



25 



(A) LENGTH: k 

5 (B) TYPE: nL^^^"^^" 

/o\ „ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: u^e^^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
10 C?caS L^^^^^" CCTAOAGC.C 

(2) INFORMATION FOR SEQ ID NO: 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3i 

(B) TYPE- 

nucleic acid 
C STRANDEDNESS: single 
(D) TOPOLOGY: Unear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGAATTCCAT ATGGACGATC TGAAGCTCTC C 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ti u . 

(B) TYPE- P^^""^ 
1^ lYPE. nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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GGGGTACCAA GCTTCACTCC TTGGCGGAGA G 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH : 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAATTCCAT ATGCTGGAGA GGCTTGAGTT T 



31 base pairs 
nucleic acid 
single 
linear 



10 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

TYPE: nucleic acid 

(C) STRANDEDNESS : single 

^5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGAATTCCAT ATGCTGGAAC GTCTGGAGTT TGGCAGCCTC CTC 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 
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GGAATTCCAT. ATGGCTCTGG AACGTCTGGA GTTnSGCAGC CTCCTC 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: An K=o« 

5 (B) TYPE. P^"^ 

/ lift.. nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAATTCCAT ATGCTGGAAC GTCTGGAATT CGGCAGCCTC 

10 (2) INFORMATION FOR SEQ ID NO: H: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- io w 

(B) TYPE P^^^^ 
» / IX fb.. nucleic acid 

^> STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGTACCCT AACCCTTGGC GGAAAGCCAG TC 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: S4 base pairs 

II TYPE: nu^ieie acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
25 GGGATGGCTA GCTCCTGGGA GAGCCTATGG GCGGACATGC CGTAGAGGAC 50 



46 



40 



32 
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GCCGTAGTTC ACCG 

INFORMATION POR SEQ ID NO: ,3^ 

SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: ,^ ^ 

(B) TYPE- pairs 
<C) STRANDEDNESS- """^^^^^ ^"cl 

TOPOLOGY '""^^^ 

-linear 

^^^^ 

'° TOR SEC ir, 

'i) SEOTOBCE OaRSCTERISTICS: 
f A) LENGTH - 

(B) TYPE- ^^^^ ^^se pairs 

15 STRANDEDNESS: J-^'r" ""'^ 

(D) TOPOLOGY- f "^^^ 

linear 

(ix) FEATURE: 

(A) NAME/KEY- FY4 

(B) LOCATION: l...ie83 

CTG GAG GCC CCC rnr^ r^r. 

4 5 



35 



48 



96 



144 
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G 

125 

His Met Glu Ala Thr 
175 



192 



240 



286 



336 



384 



432 



480 



528 



25 



GGG GTA CGG CTG GAC GTV '^''^ 

°" - - - »^ S S IS s - s s - - 

185 , ^J-U I^u 

GCG GAG GAG ATC rrn /^^« 
, Ala Glu Glu lit ^ f GAG GAG GTC TTC C^r ^ 
30 Arg Arg x,eu Glu Glu Glu Val ^ 

200 Ala Gly 

2 05 

220 

230 ^ ^ Thr Gly Lys 

- E S S S s S S S S S - T - - - 

2« ^0 ^9 Glu Ala His Pro 

4 0 ATC GTG GAG AAG ATC CTC r^n 

C CTC CAG CAC CGG GAG CTC ACC AAG CXC AAG AAC 



576 



624 



672 



720 



768 



816 
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lie val Glu Lys He Leu Gin His tog Glu Leu Thr Lys Leu Lys Asn 
"° 265 270 

?2 r CGC 

Thr Tyr val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arq 

"5 280 285 

CTC CAC ACC CGC TTC AAC CAG ACG GCC ACG GCC ACG GGG AGQ CTT AGT 
Leu Hxs Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 

290 295 



300 



310 



320 

CAG A(3G ATC CGC CGG GCC TTC GTG GCC GAG GCG GGT TCG 6CG TTG GTG 
Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
325 

15 GCC CTG GAC TAT AGC CAG ATA GAG CTC CGC GTC CTC GCC CAC CTC TCC 
Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 

345 

GGG GAC GAA AAC CTG ATC AGG GTC TTC CAG GAG GGG AAG GAC ATC CAC 
Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
355. 360 

ACC CAG ACC GCA AGC TGG ATG TTC GGC GTC CCC CCG GAG GCC GTG GAC 
Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 

375 380 



864 



912 



10 ser ser 21 p""^ f ^ "^^^ ™= ^<^C 

III Asn Leu Gin Aen He Pro Val Arg Thr Pro Leu Gly 



1008 



1056 



1104 



1152 



25 Pro I M ^ ? ''''^ """^ ^™ ™^ °TC CTC TAC 1200 

25 Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Tyr Gly Val Leu Tyr 

390 395 400 

GGC ATG TCC GCC CAT AGG CTC TCC CAG GAG CTA GCC ATC CCC TAC GAA 
Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 

410 4-^5 

30 GAA GCG GTG GCC TTT ATA GAG CGC TAC TTC CAA AGC TTC CCC AAG GTG 
Glu Ala Val Ala Phe He Qlu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
420 425 430 

CGG GCC TGG ATA GAA AAG ACC CTG GAG GAG GGG AGO AAG CGG GGC TAC 
Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
435 440 445 



1248 



1296 



1344 



40 



GTG GAA ACC CTC TTC GGA AGA AGG CGC TAC GTG CCC GAC CTC AAC GCC 1392 
Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
450 455 460 

COG GTG AAG AGC GTC AGG GAG GCC GCG GAG CGC ATG GCC TTC AAC ATG 144 0 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
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465 



40 



470 475 



4B0 



CAC GAC GAG CTC CTC CTG GAG GCC CCC CAA GCG COG GCC OAG GAG GTG 
His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 

GCG GCT TTG GCC AAG GAG GCC ATG GAG AAG GCC TAT CCC CTC GCC GTC 
Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
"° 535 540 



1488 



CCC GTC CAG GGC ACC GCC GCC GAC CTC ATC AAG CTC GCC ATG GTG AAG 
Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
485 495 

CTC TTC CCC CGC CTC CG3 GAG ATG GGG GCC CGC ATG CTC CTC CAG GTC 1536 
Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
500 



1584 



1632 



15 



CCC CTG GAG GTG GAG GTG GGG ATG GGG GAG GAC TGG CTT TCC GCC AAG 
Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
550 555 560 



1680 



GGT TAG 
Gly * 



1686 
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Claimg 

1. An enzyraatically active DNA polymerase having 
between 540 and 582 amino acids having a tyrosine at a 
position equivalent to position 667 of Tag DNA 
5 polymerase, wherein said polymerase lacks 5' to 3 » 

exonuclease activity, and wherein said polymerase has at 
least 95% homology in its amino acid sequence to the DNA 
polymerase of Thermug ^qyaticti.s, Therrm.. flavirc; or 
Th^rmu f i? thgrniophilvp. and wherein said polymerase forms 
10 a single polypeptide band or an SDS polyacrylamide gel. 

2. The polymerase of claim l wherein the amino 
acid sequence of said polymerase includes less than 3 
conservative amino acid changes compared to one said DNA 
polymerase of said named ThermuR species. 

^' The polymerase of claim 1 wherein the amino 
acid sequence of said polymerase includes less than 3 
additional amino acids compared to one said DNA 
polymerase of said named Ili££mu£ species at its N- 
terminus . 

^- The polymerase of claim 1 selected from the 
group consisting of FY2, FYS and FY4 . 



5, Purified nucleic acid encoding the DNA 
polymerase of any of claims 1-4. 
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6, Method for sequencing DNA comprising the step 
of generating chain terminated fragments from the DNA 
template to be sequenced with a DNA polymerase of any of 
claims 1-4 in the presence of at least one chain 
5 terminating agent and one or more nucleotide 

triphosphates, and determining the sequence of said DNA 
from the sizes of said fragments. 



7. Kit for sequencing 
polymerase of any of claims 

8. The kit of claim 7 
is thermostable. 



DNA comprising a DNA 

1-4 and a pyrophosphatase. 

wherein said pyrophosphatase 



9. Apparatus for DNA sequencing having a reactor 
comprising a DNA polymerase of any of claims 1-4 and a 
band separator. 
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WO-A-9405797 17-03-94 FI-A- 923911 02-03-94 

AU-B- 4960893 29-03-94 



WO-A-9012111 18-10-90 



AU-B- 


638246 


24-06-93 


AU-B- 


5438290 


05-11-90 


EP-A- 


0467953 


29-01-92 


JP-T- 


4506602 


22-10-92 


LT-A- 


1519 


26-06-95 


US-A- 


5498523 


12-03-96 



EP-A-0655506 


31-05-95 


AU-B- 


4193396 


06-05-96 






OE-U- 


29513622 


19-10-95 






OE-U- 


29513639 


19-10-95 






DE-T- 


655506 


28-09-95 






ES-T- 


2072238 


16-07-95 






WO-A- 


9612042 


25-04-96 



Fona PCr/ISA/210 (piUnt (unity uumji) (July 1993} 



